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Optimized detection of shear peaks in weak lensing maps 
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ABSTRACT 

We present a new method to extract cosmological constraints from weak lensing (WL) 
peak counts, which we denote as 'the hierarchical algorithm'. The idea of this method 
is to combine information from WL maps sequentially smoothed with a series of filters 
of different size, from the largest down to the smallest, thus increasing the cosmologi- 
cal sensitivity of the resulting peak function. We compare the cosmological constraints 
resulting from the peak abundance measured in this way and the abundance obtained 
by using a filter of fixed size, which is the standard practice in WL peak studies. For 
this purpose, we employ a large set of WL maps generated by ray-tracing through 
TV-body simulations, and the Fisher matrix formalism. We find that if \ow-S/Af peaks 
are included in the analysis {S/J\f ~ 3), the hierarchical method yields constraints 
significantly better than the single-sized filtering. For a large future survey such as 
Euclid or LSST, combined with information from a CMB experiment like Planck, the 
results for the hierarchical (single-sized) method are: Arig = 0.0039(0.004); Afini = 
0.002(0.0045); Aag = 0.003 (0.006); Aw = 0.019 (0.0525). This forecast is conserva- 
tive, as we assume no knowledge of the redshifts of the lenses, and consider a single 
broad bin for the redshifts of the sources. If only peaks with S/Af ^ 6 are considered, 
then there is little difference between the results of the two methods. We also examine 
the statistical properties of the hierarchical peak function: Its covariance matrix has 
off-diagonal terms for bins with S/J\f ^ 6 and aperture mass of M < 3 x lO^** /i~^A/q, 
the higher bins being largely uncorrelated and therefore well described by a Poisson 
distribution. 

Key words: Cosmology: theory ~ large-scale structure of Universe 



1 INTRODUCTION 

For more than a decade, weak gravitational lensing (WL) 
has been considered a powerful probe for testing cosmology 
due to its potential to map the 3D matter distribution of 
the Universe in an unbiased way, independent of baryonic 
matter tracers. 

Several surveys have already demonstrated the abil- 
ity of WL to constrain the cosmological model through 
cosmic shear measurements, e.g. the Cerro Tololo Inter- 
Ame ri can O bservatory (CTIO) lensing survey (|jarvis et al.l 
2001 I2OO6I), the Garchi ng-Bonn deep survey (GaBoDS, 
Hetterscheidt et al.ll2007l ). and the Canada- France-Hawaii 



Telescope Legacy Sur vey (CFHTLS, Hockstra ct al. 2006; 
ISemboloni et al.ll2006l '). 

Among the WL probes are shear peaks, regions of 
high signal-to-noise (S/M) in shear maps that can be pro- 



duced by individual clusters or by the alignment of several 
smaller objects on the line of sight. The abundance of shear 
peak s is as sensitive to cosmology as the cluster mass func- 
tion dMarian et afllioogl . I2OI0I : iKratochvil et al.ll2010l . I2OIII : 
lYang et al.ll201ll ). Clusters are on e of the four most pro mis- 
ing tools to measure dark energy (|Albrecht et al.ir2006 !). to- 
gether with supernova surveys, baryonic acoustic oscilla- 
tions, and WL surveys. Therefore, the WL peak function 
is equally promising in principle. There are two major ad- 
vantages of WL peaks over clusters: (i) WL peaks will come 
for free with any future lensing survey; (ii) one can very 
reliably calibrate the abundance of peaks with cold dark 
matter (CDM) simulations and proceed with direct com- 
parisons to data measurements, thus bypassing the thorny 
issue of the mass-observable relation. Shear peak signals 
need not be translated into virial masses in order to be able 
to extract cosmological i nformation from their abundance 
(IDietrich fc Hartladl2O10l ). A disadvantage is the absence of 
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an analytical fra mework for the W L peaks (though see the 
recent work of .Maturi et al.|[2010l ). 

Detections of shear peaks in WL data are exempli- 



fied in the works of IPahlc ("2006'); ' Schirmer etlj] (|2007l ): 
iBerge et al] (|2008l 'l: [Abate ct al. (20091). However, the ex- 
ploration of the shear signal of clusters has been focused 
mostly on mass determinat ion, as for instance in t he recent 
work of lOkabe etahl (|2010D and llsrael et"al] (|2010D . 

Since the introduction of the aperture mass by 
ISchneideil (|l996l ), there have been many studies of fil- 
ters optimal for peak detection and of the impact of 
large-scale structure (LS S) projec t ions o n cluster mass 
reconstructions (e.g. Metzler et al.l 20011: Hoekstral 200 ll : 



Hamana et al . 2004; Clowe et all l2004j : iTang fc FanI l2005l: 



Maturi et al.l 



2005: iHen nawi fc Spergej 120051: iMarian et all 



20101 : iBecker fc Kravtso v 2011; Gruen et al.ir201ll '). The"g 



eral agreement is that cluster masses derived from WL mea- 
surements are affected by both correlated and uncorrelated 
LSS projections, as well as by departures of the density 
profiles of real clusters from the assumed spherical models. 
These effects cause scatter and bias in the predicted and 
measured S/Af of the clusters. Nonetheless WL mass recon- 
structions retain the attractive feature of being able to rely 
on numerical simulations for accurate predictions of such 
biases. 

In this paper we address a more general ques- 
tion related to the abundance of shear peaks. Given 
the upcoming WL survey s su ch as the Kilo-Degree Sur- 
vey (KiDS , iKu iikcn 201ol), the Dark Energy Sur - 
vey (DES, iThe Dark Energv Survey Collaborationl l2005h , 
the Large Synoptic Survey Telescope (LSST) survey 
JLSST Science Collaborations et aL 2009), or the EucUd 
survev jThe Euclid CoUaborationf 20111 ). it will be possible 
to measure the shear peak function: what is the optimal way 
to do this? 

The standard approach to peak detection is: A given 
shear map is smoothed with a given filter function; in the 
smoothed map, one looks for points of local maximum which 
have S/Af higher than a certain chosen threshold value, and 
one selects these points as 'peaks'. This procedure is depen- 
dent on the filter used. There have been many studies on 
the shape of filters that maximize the S/J\f assuming certain 
shapes for the peak signal and various types of noise such 
as shape noise or projec t ion noise (e.g. [Hennawi fc Spergell 
l2005l : iMaturi et al1l2005l : iGruen et al.ll201ll ). 



Less attention has been directed towards the size of fil- 
ters. Indeed in most studies, the peak abundances are mea- 
sured using a single-size d filter (e.g. iHamana et al. 1 12004 
iDietrich fc Hartladlioiol) , though it is clear that each size 
will lead to a different p eak function . For a different ap- 
proach using wavelets, see iPires et all (|2009l ). Here we pro- 
pose a method that we call 'hierarchical algorithm': A shear 
map is smoothed with several filters of the same shape but 
different size, from the largest to the smallest. We show 
that by taking into account the extended information from 
such multi-scale filtering, one can as sign in the con t ext o f 
an assumed halo paradigm, e.g. the iNavarro et al.l (| 19971 ) 
model, a unique S /N and (redshift-dependent) mass to the 
detected peaks. We use the Fisher matrix formalism and a 
large set of simulated WL maps to show that the cosmolog- 
ical constraints derived from the hierarchical peak function 



are much improved compared to those obtained using a filter 
of the same shape but only one size. 

The paper is structured as follows. In section jj^] we 
present the A'^-body simulations and the ray-tracing per- 
formed to generate the WL maps employed in this study. 
In 33] we explain the hierarchical scheme and the filter that 
we adopt. The results of this work are presented in ^ along 
with a WL-peaks Fisher forecast for surveys like Euclid and 
LSST, the first to be obtained from simulation measure- 
ments. In iJllwe summarize and conclude. 



2 NUMERICAL SIMULATIONS AND 
RAY-TRACING 

We generated WL maps from ray-tracing through A'^-body 
simulations. We used 8 simulations which are part of a larger 
suite performed on the zBOX-2 and zBOX-3 supercomput- 
ers at the University of Zurich. For all realizations 11 snap- 
shots were output between redshifts z = [0, 2]; further snap- 
shots were at redshifts z = {3, 4, 5}. We shall refer to these 
simulations as the zHDRIZ D H sim ulations, and they were de- 
scribed in detail in lSmithI (|2009l ). 

Each of the zHORIZON simulations was performed using 
the publicly available Gadget-2 code (Springel 200 5), and 
followed the nonlinear evolution under gravity of N = 750"^ 
equal-mass particles in a comoving cube of length Lsim = 
1500 /i^^Mpc; the softening length was IsoH = 60 /i~^kpc. 
The cosmological model was similar to that d etermined by 
the WMAP experiment (|Komatsu et al.ll2009l ). We refer to 
this cosmology as the fiducial model. The transfer function 
for the simulations was gene rated using the pu blicly avail- 
able cmbfast code (Seliak fc Zaldarriagj [19961 ), with high 
sampling of the spatial frequencies on large scales. Initial 
conditions were set at redshift z = 50 u sing the serial ver- 
sion of the publicly available 2LPT code (jScoccimarr ol ll998l : 
ICrocce et al.l[200a) . Table [T] summarizes the cosmological 
parameters that we simulated and Table [2] summarizes the 
numerical parameters used. 

For the Fisher matrix study of peak counts, we em- 
ployed another series of simulations. Each of the new set 
was identical in every way to the fiducial model, except that 
we have varied one of the cosmological parameters by a small 
amount. For each new set we have generated 4 simulations, 
matching the random realization of the initial Gaussian field 
with the corresponding one from the fiducial model. The 
four parameter variations were: {n —>■ (0.95,1.05), as 
(0.7,0.9), fim ^ (0.2,0.3), w (-1.2,-0.8)}, and we refer 
to each of the sets as zHORIZON-Vla.b,. . . ,zH0RIZ0N-V4a,b, 
respectively. The details are summarized in Tables [1] and (2] 
For the WL simulations, we considered a survey simi- 
lar to Euclid (iThe Euclid Collabor ation 2011*) and to LSST 
(|LSST Science Collaborations et al.. i2009,), with: an rms 
= 0.3 for the intrinsic image ellipticity, a source num- 
and a redshift distribution of 



ber density n = 40arcmin 
source galaxies given by: 

V{z) = Af{zo, P) z^ exp[-(z/2o)''] 



(1) 



where the normalization constant M insures that the in- 
tegral of the source distribution over the source redshift 
interval is unity. If this interval extended to infinity, then 
the normalization could be written analytically as: M = 
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Table 1. zHORIZON cosmological parameters. Columns are: density parameters for matter, dark energy and baryons; the equation of state 
parameter for the dark energy; normalization and primordial spectral index of the power spectrum; dimensionless Hubble parameter. 



Cosmological parameters 








w 




n 


J^o[kms-iMpc-l] 


zHORIZON-I 


0.25 


0.75 


0.04 


-1 


0.8 


1.0 


70.0 


zHORIZON-Vla/Vlb 


0.25 


0.75 


0.04 


-1 


0.8 


0.95/1.05 


70.0 


zH0RIZ0N-V2a/V2b 


0.25 


0.75 


0.04 


-1 


0.7/0.9 


1.0 


70.0 


zH0RIZ0N-V3a/V3b 


0.2/0.3 


0.8/0.7 


0.04 


-1 


0.8 


1.0 


70.0 


zH0RIZ0N-V4a/V4b 


0.25 


0.75 


0.04 


-1.2/-0.8 


0.8 


1.0 


70.0 



Table 2. zHDRIZDN numerical parameters. Columns are: number of particles, box size, particle mass, force softening, number of realiza- 
tions, and total simulated volume. 



Simulation Parameters 


-/Vpart 


Lsim [/i-^Mpc] 


mp[h-^MQ\ 


hoit[h ^kpc] 


-^cnscmb 


Vtoi[h-^ Gpc3] 


zHDRIZDN-I 


750^ 


1500 


5.55 X 10" 


60 


8 


27 


zHORIZON-Vl, -V2, -V4 


750^ 


1500 


5.55 X 10" 


60 


4 


13.5 


zH0RIZ0N-V3a 


750^ 


1500 


4.44 X 10" 


60 


4 


13.5 


zH0RIZ0N-V3b 


750^ 


1500 


6.66 X 10" 


60 


4 


13.5 



3/(2:0 r[(3 + /3)//3]). There is a small difference between this 
value and what we actually used, due to the fact that we 
considered a source interval of [0,3]. We took /3 = 1.5, 
and required that the median redshift of this distribution 
be Zmcd = 0.9, which fixed 20 0.64, and gave a mean of 

^mean 0.95. 

From each A''-body simulation we generated 16 indepen- 
dent fields of view. Each field had an area of 12 x 12 deg^ 
and was tiled by 4096^ pixels, yielding an angular resolu- 
tion 0pix = lOarcsec. For each variational model, the to- 
tal area was of x 9000 deg^, while for the fiducial model it 
was of * 18000 deg^. The effective convergence k in each 
pixel was calculated by tracing a light ray back through the 
simulation with a multipl e-lens-plane ray-tracing algorithm 
(IHilbert et al. I l2007l , l2009l '). Gaussian shape noise with vari- 
ance a^/(n0p;x) was then added to each pixel, creating a 
realistic noi se level and correla tion in the filtered conver- 
gence field (|Hilbert et al.|[2"007h . We keep the shape noise 
configuration fixed for each field in different cosmologies, in 
order to minimize its impact on the comparisons of the peak 
abundances measured for each cosmology. 



3 SMOOTHING WEAK LENSING MAPS 
3.1 A matched filter 

To find peaks in W L maps, we smooth the latter with a n 
aperture- mass filter (|Schneideill 19961 : [Schneider et al.ll 19981 ). 

The smoothed convergence map is a convolution between 
the filter function and the fi:/7 field of our simulations: 

Map(6»o) = Jd"6( (/(6»o-6») «;(6>) = Jd'e Q(6»o-6>) 7(0), (2) 

where k, is the convergence, 7 is the tangential shear field, 
and U and Q are aperture filters for convergence and shear 
respectively. 0o is an arbitrary point. Aperture mass filters 
are compensated, which for a spherically symmetric function 
can be expressed through the equation: 



f 

Jo 



Aeeu{e) = o. 



(3) 



where 6a is the compensation radius. In the presence of the 
ellipticity noise of the source galaxies, it can be shown that 
an optimal and compensated filter is given by 



u(e) = c ■ 



?/n 



(4) 



where C is an arbitrary normalization constant, Km is the 
adopted model for the convergence profile of peaks, i.e. NFW 
or similar, and a^/n is the shape noise variance per ellip- 
ticity component. The mean convergence inside a radius 9 
is defined by k(6') = 2/9^ d9' 9' K{e'). The analogue filter 
function for the shear field is given by 

- lm{9) 



Q{e) = c- 



(5) 



where 7in is the assumed tangential shear model of the 
peaks. Under the assumption that the shape noise is the 
dominant source of noise in the measurements, this filter is 
optimal because it maximizes the signal-to-noise (S/M) at 
the location of a peak with the convergence/shear profile 
«:m/7m. From Eqs ([2]) and Q, the S/M can be written as 



or in terms of the shear 

IT id'e7mW 7(00-0) 



S/M{Oq) 



(6) 



(7) 



Note that the S/^f does not depend on the arbitrary nor- 
malization constant C. 

Although we do not make a comparison between peaks 
and clusters, and indeed do not use any information on the 
simulation halos in this study, our choice of C provides in- 
sight into the halos that generate the peaks (though of course 
not all the peaks will correspond to a halo). If Oq denotes 
the location of a peak formed by a halo of mass Mm, redshift 
Zm, and profile k{9) = Kni{9; Mm, Zm), then we require that 
the amplitude of the smoothed map at the location of this 
pea k be exactly Mm'- Afap(0o) = Mm- In this case, C is given 
by l|Marian fc Bernsteinll200g) : 
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m 



Figure 1. Matching between the filter and peak profiles in the 
hierarchical method. The larger filter yields an aperture mass 
while the next filter in the sequence, of smaller size 



ap 1 

AI^^ , gives an aperture mass Mlp^ 



> M, 



The solution M 



to Eq. II12I I is found through interpolation to be Eq. II13I I. We take 
this to be the 'true' mass of the peak. 



C(M,„) = a;/n 



(8) 



In the above equation, Km(^) = Km(6; Mm, Zm) and it is 
assumed that the radial integral has an upper limit of 8a- 
The latter applies also to Eqs ©, 0, ITO)). Note that for 
the shear filter the normalization is the same, since 



2n 



Jo 



f 

Jo 



Our analysis was performed on convergence maps, and so in 
the following, we shall focus on the latter. Inserting Eqs Q 
and ([H]) into Eq. we write down the amplitude of the 
smoothed map for our particular choice of matched aperture 
filter: 



Map(6»o) 



Mr 



I d^e [^^(e) - Km(6>A)]«:(6>o - 6>) 



= VC(Afm) Mm 5/7V(6>o) 



(10) 



Within the validity bounds of our model, i.e. the peak is 
indeed generated by an NFW halo of that mass and redshift, 
Eq. (|10|l represents an unbiased estimator for mass. 

Finally, we assume a relation between the model mass 
Mm and the compensation radius: We take the latter to be 
the angular scale subtended by the virial radius of a halo 
with mass Mm, redshift Zm, and convergence profile Km'- 



)/DA{Zn 



(11) 



where DA{zm) is the angular diameter distance to Zm- This 
choice enables us to connect the 'size' of the filter, i.e. the 
aperture radius, with the 'mass' of the filter Mm, using the 
standard relation between mass and virial radius provided 
by models of structure formation, such as NFW for instance. 
Given the source distribution in Eq. IT]), we take Zm = 0.3. 
This is just a clarification of what we mean by size and mass 
of the filter, all the necessary details will be provided in i 



3.2 Hierarchical Filtering 

We shall now describe our method to detect shear peaks and 
assign them masses an d S / Af. T his was already implemented 
in our previous works [Marian et al.l (j2009l . I2010I . I20l1[ ). 

As mentioned in ij3.1l smoothed maps are obtained by 
convolving the convergence field with a filter, e.g. Eq. 0. 
Peaks are detected as local maxima in the smoothed maps, 
i.e. points with amplitude higher than that of their 8 neigh- 
bors, where the amplitude is given by Eq. ([5]), or in our par- 
ticular case, by Eq. (jlOp . Medium or large peaks will still 
be local maxima even when smoothed with filters of size 
much larger or smaller than the peak radius. But the S/Af 
and amplitude associated to the peak will be quite different 
for a range of filter sizes spanning 1-2 orders of magnitude. 
Therefore, there is some degree of arbitrariness when try- 
ing to classify the abundance of WL peaks in terms of their 
S/Af or amplitude: the answer will depend on the filter size. 
Also, if the peaks are small, then a large filter may ren- 
der them quite indistinguishable from spurious shape-noise 
peaks. The strengths of our top-down approach are: 

(i) It uses filters of several sizes, which will increase the 
scale range of the detected peaks and therefore the cosmo- 
logical information of the peak counts. 

(ii) It uses an interpolation scheme for the results of the 
smoothing with each filter to accurately establish a unique 
value for the S/Af and amplitude A/ap of each peak. Thus 
the ambiguity of classifying peaks in terms of their S/J^ 
is removed, and one obtains a 'general' peak function, as 
opposed to a different peak function for each filter size em- 
ployed in smoothing. 

The concrete steps that we take are as follows. We 
smooth the maps with a sequence of filters of different sizes 
(masses), in a hierarchical fashion, from the largest to the 
smallest size. The purpose is to determine the 'mass' of the 
peaks, i.e. the filter size which matches best the size of the 
peaks: 



Map(6>o) = M„ 



(12) 



where Go denotes the location of a detected peak. For each 
filter i in the sequence, the peaks are selected so that: (a) 
Mip ML; (b) S/Af' ^ {S/Af)min, where M^, is the size of 
the filter. Af^p and S/Af^ are the aperture mass and S/N 
defined in Eqs ((6| and (|10|) corresponding to this particular 
filter, and we choose (5/A/')min = 3 as a detection threshold. 

Equation (|12|) is not likely to be satisfied by any par- 
ticular filter in the sequence, hence we find its solution by 
interpolating between the results of smoothing with differ- 
ent filters in the sequence. This is illustrated in Figure [1] 
Suppose there is a peak of true mass M and S/M (true 
according to the assumed model). Then there will be two 
consecutive filters in the sequence, i and i + 1, for which the 
following relations are true: 



S/M' > S/M > S/M' 



Ml > Afip > M > m:;' > Mi:\ 

Since the aperture mass of the peak obtained from consec- 
utive filters varies gently with the size of the filter, we can 
use linear interpolation to write down the solution for the 
true mass, i.e. the solution of Eq. (|12|l . 



M 



M* M' 



Mi^H'Il 



Mi, ~ Mi 



Ml 



■ Ail+i ■ 



(13) 
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Once the true mass is determined, we use Eqs (|10|l and ([121 
to determine the S/M of the peak: 



S/M = ^M/C{M) 



(14) 



Given our two selection criteria, in the above example the 
point of local maximum will be selected as a peak by the 
filter i + 1, but not by its predecessor i, if the signal to noise 
will also be above the detection threshold. It will also appear 
as a peak in the maps smoothed with filters < M,^^, pro- 
vided that the same S/^/ requirement is fulfilled. In order to 
be able to carry out the interpolation scheme, for each filter 
used we record the aperture mass, S/M, and the 2D location 
of the peaks. A peak might slightly change its coordinates 
in maps smoothed with different-sized filters; we take this 
into account, and allow for variations of up to 4 pixels in 
the X, y directions of the map (a pixel has 6'pix = 10 arcsec). 
We also record those points of maximum where the aper- 
ture mass is smaller, but not much smaller than the mass 
of the respective filter; to be specific, the points obeying the 
condition: 0.6M^ < M^p < M^. These points we call 'pseu- 
dopeaks' and they are likely to be selected as peaks by the 
next filter in the sequence; therefore, they are useful for the 
interpolation that we perform later. The value 0.6 is of no 
particular significance, it is suitable for the logarithmically- 
spaced sequence of filters that we apply, based on several 
trials. 

Finally, the processing of the peaks resulting from the 
smoothing with the hierarchical sequence of filters, consists 
of the following steps: 1. We exclude from the maps those 
peaks already selected by a larger filter; 2. We apply the in- 
terpolation scheme to assign the remaining peaks a unique 
aperture mass, according to Eq. (|13|l . We then use this mass 
to compute the S/M value, according to Eq. (|14p : 3. We 
exclude those peaks that are within the virial radius of a 
larger peak, since in many of such cases, the second peak is 
just an artifact of the smoothing or we simply deal with a 
very clumpy halo that is split by the smoothing into a large 
peak and some small ones. We thus remove the problem of 
'peaks-in-peaks' and also do not count substructures as in- 
dependent halos. This is done for the purpose of obtaining 
a 'clean' peak function, but ultimately such events concern 
only small peaks and we have checked that the cosmologi- 
cal constraints derived from the counts are not significantly 
altered by these exclusions. 



3.3 Model specifications 

We base our halo model on the NFW density profile: 

2" 



PnFw(''') = p5c 



^ 1+^ 



(15) 



where p is the mean matter density of the Universe, 5c 
is the characteristic overdensity, and Vs is the scale ra- 
dius. We adopt the Sh eth-Tormen (ST) definition of mass 
l|Sheth fc Tormenlll999l ): Mvir = 47r7£5i^Avirp/3, i.e. we use 
the mean matter density to define the overdensity for halo 
formation, as opposed to the critical density, pcrit. The two 
are related by p = f^mPcrit. Avir = 200 for ST and NFW. 
Integrating Eq. p5|l to obtain the virial mass and using the 
above definition for the latter, one arrives at the following 
expression for the characteristic overdensity: 



Sc = 



log(l +c)- c/(l + c) ' 



(16) 



where the concentration parameter is defined by c = Rvh/rs- 
In the ACDM model, ST and NFW halos have the same 
density profile, but ST halos have larger cut-off radii and 
concentration parameters than NFW ones. For the concen- 
tration parameter we employed the numerical prescription 
of Gao et al. (2008), whilst to translate NFW t o ST param- 
eters, we used the approach of lSmith fc Watt3 ({20051 ') . 

We use the truncated convergence profile resulting from 
this profile, i.e. we limit the projection of the 3D density 
along the line of sight to a region delimited by the virial 
radius: 



^crit J-^fl2_^-r2 ^ 

with Ecrit being the critical surface density for lensing. The 
above equation can be rewritten as 



(17) 



2rs 5c p , . 
Knfv/(x) = — f(x), 



(18) 



where x = r±/rs is adimensional, and the function / de- 
pend s on cosmology only through the concentration param- 
eter (|Hamana et al.ll2004h : 



2^1/2 



cosh 



-1 / x^ + c 



x{l + c) 



(l-x-2)(l + c) " (l-a;2)3/2 
(c^-l)i/2 



X < 1 



3(1 + c) V^ + T-fc 



2x1/2 



, X = 1 
-1 ( x^ + c 



(19) 



x{l + c) 



(1 - x'){l+c) (t^ - Ifl^ 

0, x > c. 



X > 1 



The model that we assume for the k of halos is a convolution 
of the NFW convergence profile defined by Eq. (|19|) with a 
two-dimensional (2D) Gaussian function with the width of 
the order of the softening length of the simulations: 



Kn.(6>) = J 



(20) 



The above equation can be rewritten as 



Kn.(6>) = 



exp 



JO 



de' 



\ exp 



2al 



(21) 



where oq is the width of the Gaussian function, and Jo is 
the modified Bessel function of order 0. The dependence 
on the lens redshift is implicit for both Knfw and CTq. This 
model choice accounts for the finite resolution of the numer- 
ical simulations. The convolution in Eq. (|21|l has a similar 
effect to 'coring' the convergence profile, i.e. making it flat in 
the centre of the cluster, where the WL regime breaks down 
and measurements are very difficult to obtain. For numerical 
simulations, cored profile models are desirable because one 
cannot resolve structures below the softening length. Lastly, 
Eq. (|21|l alleviates uncertainties in the location of the centre 
of the peak, which could lead to large discrepancies between 
measured and theoretical profiles, if the latter have a cusp 
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Figure 2. Comparison between measured and tlieoretical convergence profiles for tlie unsmoothed fiducial cosmology maps, with ttie 
sources at redshift 1. Tlie red points represent the average of the measured profiles around the centres of the peaks detected with the 
hierarchical method of i|3.2l The peaks have been binned according to the mass assigned through that method, and the panels depict 
different mass bins. The blue solid lines represent the theoretical profile from Eq. l|21|l . for the mean mass of the bin. The redshift of the 
filter is set at 0.3, and the error bars are on the mean of 128 fields. The maps are free of shape noise. 



at the centre, e.g. like NFW. Therefore, we take the width of 
the Gaussian present in the convolution to be ctg = Q^soft, 
where hoit is the softening length of the simulations, a = 2 
for ST halos with M ^ 7 x lO^"*/i"^M0, and a = 1.5 for 
M < 7 X lO^/i^^M©. For the redshift Zm = 0.3 that we 
assume for our filter, ctg = 22 (29) arcsec respectively. 

Figure [2] shows the comparison of the theoretical and 
measured profiles for the unfiltered convergence maps of the 
fiducial model corresponding to sources at redshift 1, in the 
absence of shape noise. We use the hierarchical method to 
assign masses to peaks. The peaks are binned according to 



the assigned mass, and the coordinates of their centres are 
used to measure the shear and convergence profiles. Each 
panel in the figure corresponds to a mass bin. The red points 
depict the average of the measured convergence profiles of 
the detected peaks. The error bars correspond to errors on 
the mean of the 128 fiducial fields. The solid blue lines rep- 
resent the theoretical profile of Eq. pip , estimated for the 
mean mass of the peaks in the bin and the redshift Zm = 0.3, 
i.e. the optimal redshift for lensing for sources at redshift 
1. The agreement between the model and measurements is 
remarkable, given the fact that some peaks correspond to 
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Figure 3. The number of peaks per unit S/Af for an area 
of 144 deg^. Different symbols and colors denote different fil- 
ter sizes: red circles correspond to the hierarchical filtering, 
and blue squares/purple stars/green triangles to the M = 
2 X 10^5, 5 X lO^*, W^^h-'^MQ filters respectively. The redshift 
of the filter is kept fixed throughout the analysis at Zm = 0.3. The 
results are the mean of the functions measured from 128 fields of 
the fiducial cosmology, and the error bars are on the mean. 

halos at different redshifts or to no halos at all, and the fact 
that we assume a spherical density model, which is bound 
to fail for peaks arising from aspherical halos, or to be af- 
fected by projection effects. Despite these limitations, the 
hierarchical method classifies peaks efficiently on average, 
as shown in the figure. Note that the bottom panels of Fig- 
ure [2] present an oscillatory feature around the virial radius 
of the profiles. We generated convergence maps of synthetic, 
perfect NFW halos, and checked that such features can ap- 
pear if noise is added to the maps. This owes to the fact that 
the compensated filter prefers to select peaks which have re- 
gions of low convergence around the virial radius. This effect 
is more pronounced for smaller peaks because these most 
likely correspond to small halos, which have increased par- 
ticle shot noise. We have measured the convergence profiles 
of the friends-of-friends (FoF) halos of the simulations at 
redshift 0.3, and did not find such features as seen in Figure 
[21 from which we conclude that they are caused by the filter 
selection. 

We use the above-mentioned values for a in Eq. H21|) to 
obtain Figure [51 based on several trials, we find these values 
to yield the closest resemblance between the measured and 
theoretical profiles. We do this test in the absence of shape 
noise, since we are trying to address a technical issue arising 
from our numerical simulations: the impact of the softening 
length. In the absence of shape noise, the algorithm in i]3.2l 
can be applied by formally setting cr^/n ^ 1 in Eq. Q , and 
using only the mass criterion to select peaks; Eq. (|10|) does 
not change. 

The hierarchical method could be useful for determin- 
ing cluster masses from WL profiles. In order to increase the 
accuracy of the results, one should perform a careful analy- 



sis of: the chosen filter and its parameters, the compensation 
radius Oa, the inclusion of the halo-matter cross-correlation 
term visible in Figure [51 the impact of shape noise and pro- 
jection noise, the impact of photometric redshifts errors of 
the source galaxies. However, this is beyond the goals of the 
present study. 



4 RESULTS 

We present a comparison between peak statistics results ob- 
tained through the hierarchical algorithm described in 331 
and from applying three single-sized filters of different size. 
When using the single-sized filters, we keep the same filter 
function as given in section 331 as this work is not concerned 
with assessing the performance of filters of different shape. 
In this case, we simply select the peaks by requiring that 
their S/J\f > (<S/A/')min, with the S/M given by Eq. ©. The 
three sizes that we consider correspond at redshift Zm = 0.3 
to the masses {2 x 10^^ 5.5 x lO", lO"} /i^^Mq, with the 
angular size of the virial radii given by {13.2, 8.6, 4.8} ar- 
cmin, respectively. Note that due to the fact that ST ha- 
los have larger radii than NFW ones of the same mass, 
these angular sizes are also slightly larger than NFW an- 
gular sizes. For the hierarchical filter we consider a se- 
ries of 12 filters, logarithmically spanning the mass inter- 
val [8.85 X 10^^2 X W^^]h-^MQ, and with S/Af > 2.6. 
Ultimately, throughout the entire analysis for the fixed-size 
and hierarchical methods, we shall use only peaks above 
the threshold {S/J^)inin = 2.8, but going to lower val- 
ues ensures the completeness of the sample of hierarchical 
peaks. The shape noise contamination makes it difficult to 
consider smaller filters. We bin the resulting peak abun- 
dances in terms of S/Af, logarithmically spanning an interval 
[2.8,14]. For reasons discussed in Appendix we choose 
-^bin = 20. Note that for the hierarchical abundance it is 
useful to also consider binning in mass, as assigned through 
Eqs pop. (|12p . p3p . This allows to draw analogies between 
the properties of the WL peaks and those of 3D haloes. Here 
too we use 20 bins spanning [lO'^*, 2 x lO'^^] H'^Mq, the lower 
bound roughly corresponding to the (iS/7V)min = 2.8 for the 
analyzed cosmological models. 

Comparing peak abundances obtained with different 
methods is not necessarily relevant: the results will be clearly 
different, and it would be hard to decide which filter size is 
more effective. This is shown in Figure [3] where we present 
the peak functions corresponding to the three single-sized 
filters, as well as the hierarchical method. The functions are 
expressed as number of peaks per unit S/J\f for an area of 
144 deg^, and the results are an average of the peak func- 
tions measured in the 128 fields of the fiducial model. The 
error bars correspond to errors on the mean. As expected, 
each single-sized filter favors the detection of peaks with 
iS/A/" in accord to its size: the smallest filter peak abundance 
is mostly formed by low S/Af peaks, and similarly for the 
medium and large filters. The hierarchical peak abundance 
is similar to the largest-filter abundance for the high-S/Af 
bins, and to the smallest-filter abundance for the low-S/Af 
bins. We shall next explore how the measured abundances 
translate into cosmological constraints. 

To this effect, we shall resort to the Fisher matrix for- 
malism, with four clear goals: 
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Oh 




Figure 4. Derivatives of the peak abundances with respect to the cosmological parameters considered, as a function of S/Af. The red 
solid circles are measurements with the hierarchical method, and the blue solid squares correspond to smoothing with a filter of fixed 
size M = 2 X (angular size of the radius 13.2 arcmin for = 0.3). The derivatives are estimated in accord with Eq. 13111 . 

and are divided by the mean counts of the fiducial model. We show the errors on the mean of 64 fields. 



(i) to provide a comparison between the filtering meth- 
ods; 

(ii) to test which range of mass or S/Af contributes most 
to the constraints derived from WL peak counts; 

(iii) to test the difference between the errors obtained by 
using the full covariance matrix of counts, and the Poisson 
errors; 

(iv) to provide a realistic forecast for surveys like LSST 
and Euclid, in a very direct manner, based on simulation 
measurements . 



4.1 Fisher matrix considerations 

Using the measured peak abundances, we compute the 
Fisher information following the standard definition: 



ln£ 

dpadpb 



(22) 



where Pa and Pb are elements of the cosmological model 
parameter set p upon which the likelihood £ depends. In 
our case the set is: {n, as, flm, w}. We assume a Gaussian 
likelihood 

£(m|m(p),C(p)) ^ 



(27r)^b.„/2|C|i/2 
1 , 



X exp 



-(m 



1 / 
(m 



(23) 



where m is the vector of peak counts, and m is the vector 
of mean number of peaks; both vectors have the dimension 



A^bin, i.e. the number of bins considered. The covariance ma- 
trix of the counts in bins i and j is 



dj = {{nii - nii) {nij - mj)>. 



(24) 



From the Fisher matrix, one may obtain an estimate of the 
marginalized errors and covariances of the parameters: 

'^PaPb ~ ]paPb' (25) 

as well as the unmarginalized errors: 

O-Pa = [J^PaPaV^'^- (26) 

The size of the errors quantifies the efficiency of the filtering 
method to extract cosmological information from WL peak 
counts. For sim plicity, we shall ignor e the trace term in the 
Fisher matrix, (|Tegmark et al.|[l997l ). In this case, Eq. 
can be rewritten as 



_ sr^ drm „-i dmj 
-^p^v^- h ^r.. ' 



dpb 



(27) 



We are also interested in the Poisson errors of the peak 
counts, since the Poisson statistic is widely adopted in fore- 
casting cosmological constraints from WL peak counts. They 
are given by 

P ^ dfhi dfhi 1 



8pa dpb rui 



The mean number of counts for bin i is estimated as 



1 ^ 

- y 



mi = — y , m. 



(28) 



(29) 
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Figure 5. Fractional marginalized Fisher matrix errors based on measurements from simulations. The symbols and colors arc the same 
as in Figure |3] The errors are cumulative: starting from the highest bins, we gradually allow the rest of the bins to contribute to the 
constraints. The central values of the S/M bins are indicated on the ic-axis. 



In the above / designates the field number, while A'' is the 
total number of fields; for the fiducial cosmology, A'^ = 128, 
and for the variational cosmologies = 64. An unbiased, 
maximum-likelihood estimator for the covariance matrix is: 

1 ^ 

The derivatives of the counts with respect to the cosmolog- 
ical parameters are calculated from 

drht _ _j_ Y rn{{pa + Apa) - m{{pa - Apa) 

dpa ~ N 2Apa ' ^ ' 

where Apa represents the ± step in the cosmological param- 
eters, e.g. Table [T] 

We estimate the Fisher matrix errors using the covari- 
ance on the mean for the counts of the fiducial model; 
the rescaled covariance matrix corresponds to an area of 
* 18000 deg^. Together with the survey specifications given 
in section ij2] this makes our study representative for two 
future surveys, LSST and Euclid. 

4.2 Comparison of filtering methods 

Figure U depicts the derivatives of the measured peak abun- 
dances with respect to the four cosmological parameters that 
we consider, as a function of S/M. We show results for the 
hierarchical method and the largest of the single-sized fil- 
ters, M = 2 X IQ^^h'^ Mq. The derivatives are estimated 
using Eq. (|31|l , and the result is divided by the mean counts 



of the fiducial cosmology. The figure shows that both filter- 
ing methods yield peak functions similarly sensitive to cos- 
mology, with the hierarchical derivatives displaying slightly 
more features than the single-sized ones. For most of the 
S/M range considered, the peak-function derivatives are 
non-zero, signifying that there is cosmological information 
in the high-iS/A/" pea ks, as well as in the \ow-S /M ones, as 
previously noticed bv lDietrich fc Hartlad l|2010l ). This origi- 
nates in a similar behavio ur displayed by the halo mass func- 
tion: In a previous work (|Smith fc Marianll201ll ). we found 
the derivatives of the latter with respect to the same param- 
eters studied here to be non-zero for a large range of halo 
masses, down to M = 10^'^ h^^ Mq (compare Figures 6, 7 in 
that work with Figures [4] [5] in this work). 

Figure [5] depicts the marginalized Fisher errors for the 
four cosmological parameters that we consider. The errors 
are fractional, i.e. the error for each parameter is divided 
by the fiducial value of that parameter, and cumulative, i.e. 
we include S/M bins cumulatively, from the largest to the 
lowest. The central values of the bins are indicated on the 
X-axis. The symbols and colors are the same as in Figure |3l 
It is apparent that the hierarchical filtering performs bet- 
ter than the single-sized filtering, if one takes into account 
peaks with S/M ^ 4. The greatest improvement is for w: 
the hierarchical error is smaller by more than a factor of 2 
compared to the fixed-size filtering. A smaller improvement 
happens also in the case of erg and fim, while the error on Us 
seems unaffected by the filtering method, due to generally 
poor constraining power that peaks have on this parameter. 
Figure [5] reinforces the suggestion of Figure |4] that the in- 
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elusion of peaks with small S/J\f improves significantly the 
cosmological information. 

Note that for the hierarchical method we can also use 
aperture-mass bins to measure the peak function deriva- 
tives, the covariance matrix, and the Fisher matrix, with 
the mass given by Eq. (|13|l . We obtain similar results to 
those presented in Figures [4] [5] The single-sized filters per- 
form very similarly, the largest one being marginally better 
in the case of f2m, erg, and Us- Its diameter of 13.2 arcmin 
is lar ger than what previo us studies in the literature have 
used: Hamana et a l. (2004) had a 1-arcmin Gaussian filter, 
iHennawi fc Spergel (2005') employe d a ~ 5-arcmin NFW fil- 
ter, and Dietrich fc Hartlag (|201G| ) a 5.6-arcmin one. Since 
the constraints from filtering with fixed sizes are so similar, 
we shall only show the results from the marginally-better 
M = 2 X lO^^h-^MQ one. 

In Appendix ij^we examine the statistical properties of 
the peaks detected through the hierarchical and fixed-sized 
methods. We find that: 

• The Poisson statistic describes well the distribution of 
hierarchical high-S /J\f and high- mass peaks. The mass is 
defined by Eq. (|13p . We show that the correlation matrix of 
<S/A/'-binned hierarchical peaks has strong off-diagonal con- 
tributions for the small-5/7V bins, while being largely diag- 
onal for the large-iS/A/" bins. The same applies to the mass- 
binned correlation coefficient , and this behaviour is similar 
to that of halos, as shown in lSmith fc MarianI (|201ll 'l. 

• The high-iS/A/" single-sized peaks are also reasonably 
described by the Poisson distribution, due to the fact that 
such peaks are usually quite massive and rare. The corre- 
lation matrix of these peaks seems slightly more correlated 
for S/J\f ^ 7 than the hierarchical matrix. 

Lastly, in Appendix !jB]we investigate the dependence of the 
Fisher errors on the number of bins in which the S/Af inter- 
val is divided. Figure [Bll suggests that all filtering methods 
reach the expected saturation in information if A'^bin ^ 20, 
which is why we choose Abin = 20 for the results presented 
in this work. 



4.3 Forecasting constraints on cosmology 

We present a Fisher-matrix forecast for the 4-dimensional 
cosmological parameter space explored in this work. This 
will enable us to compare the filtering methods in a more 
realistic context, using marginalized errors and also cosmic 
microwave background (CMB) information. 

For the Planck Fisher matrix, we shall assume that the 
CMB temperature and polarization spectra can constrain 
9 parameters: the dark energy equation-of-state parameters 
Wo and Wa', the density parameter for dark energy Ode; the 
CDM and baryon density parameters scaled by the square 
of the dimensionless Hubble parameter ucdm = JIcdm^^ 
and Ub = ^Ibh^ {h = _ffo/[100 kms^^ Mpc^^]); the primor- 
dial spectral index of scalar perturbations Tig; the primor- 
dial amplitude of scalar perturbations As; the running of 
the spectral index a; and the optical depth to the last scat- 
tering suTfe£e_J^_^oco^l£U^ the CMB Fisher matrix we 
follow lEisenstein et al] (|l999l '): 



where {X, Y} e {TT, EE, TE, BB}, where Cz.tt is the tem- 
perature power spectrum, Cj^ee is the E-mode polarization 
power spectrum, Cj^te is the temperature-E-mode polariza- 
tion cross-power spectrum, and Ci^bb is the B-mode po- 
larization power spectrum. The assumed sky coverage is 
/sky = 0.8 In order to make the CMB Fisher matrix com- 
patible with our parameters, we rotate it to a new set 



{too, ^fa, fim, /l, fb,T, ris, (78, o} 



(33) 



I X.Y 



dC, 



dpa 



-COV-^ [Cl,X,Cl,Y 



dCi. 



dpb 



(32) 



where for us uiq = w. We marginalize over the 5 parameters 
absent from our analysis. 

Table |3] and Figure |6] represent the main results of 
this work, showing the overall improvement the hierarchical 
method brings over the fixed-size method after marginal- 
ization and especially after the inclusion of the CMB infor- 
mation. For the fixed-size method we choose the filter with 
M = 2 X lO'^^ h~^MQ, i.e. the best-performing filter among 
the fixed sizes that we have probed. We also show the un- 
marginalized errors, for a more complete picture. 

Combined with the CMB, the hierarchical errors are a 
factor of 2 better than the single-filter method for fim and 
(78, and almost a factor of 3 better for w. For Us there is 
no significant difference between the filtering methods. This 
happens because the CMB constrains the primordial power 
spectrum tighter than WL peak counts. 

We further depict these results in Figure |6] as 2a- 
ellipses; the blue dashed ellipses correspond to the single- 
sized method, and the red solid ones to the hierarchical al- 
gorithm. Here we see again that the latter really improves 
the joint constraints for {(T8,nm,U'}. 

It is difficult to make a comparison to previous fore- 
casts in the literature, as the probed parameter space and 
survey sp ecifications are not the same, so we shall mention 
only two. IWang et al.l ()2004h presented a forecast for LSST 
in which besides WL counts they also include the cluster 
power spectrum, which they treat as completely indepen- 
dent. They consider a larger parameter space than ours, 
including Wa and uji,, and assume ft = 65arcmin^'^ and 
(iS/7V)min = 4.5. The constraints that they find when com- 
bined with Planck priors (see Table 6 in their paper) are: 
Aris = 0.0022, AQue = 0.0033, Aa8 = 0.0037, Aw = 0.036. 
They use a Gaussian filter of size 1 arcmin, and their fidu- 
cial model has Q^n = 0.27, as = 0.9. The results are rather 
similar to ours, though their constraint of w is surprisingly 
tight, given the sensitivity of Aw to {S/Af)uiin - higher than 
ours - and the fact that they include Wa, known to degrade 
substantially the constraint on w. 

We also make a com parison to our previous work 
(IMarian fc Bernsteinir2006h . which uses the same type of 
normalized filter as this study. The detection threshold is 
5, the projection noise is accounted for, and instead of ris, 
Wa is consi dered. The fiducial values for Qui and ag are the 
same as in I Wang et al.l (|2004l ) ; combining with the Planck 
information, we found the constraints: Afim = 0.005, Aa8 = 
0.004, Aw = 0.063. These are in agreement with the results 
from the present study, given the above-mentioned differ- 
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Figure 6. Forecasted marginalized errors for WL peak counts from a Euclid-type of survey combined witii CMB constraints from 
Planck. The blue dashed ellipses depict the results of the fixed filter M = 2 x 10^^ h~^MQ, while the red solid ellipses correspond to the 
hierarchical method. In both cases, the WL Fisher matrix has been added to the Planck Fisher matrix, and the constraints are at the 
2 — (T level. 



Table 3. Fisher matrix constraints for the hierarchical method and a fixed filter of M = 2 x fO^^ h ^Mq. The fiducial values for the 
parameters are ns = 1, f!m = 0.25, erg = 0.8, w = —1. 



Hierarchical errors Fixed-size errors 











w 


ris 


r2ni 


o"8 


to 


Unmarginalized 


0.0025 


0.0006 


0.0015 


0.0125 


0.0064 


0.0008 


0.0019 


0.0128 


Marginalized 


0.0094 


0.0038 


0.0043 


0.0235 


0.0105 


0.0046 


0.0061 


0.0552 


Marginalized + CMB 


0.0039 


0.002 


0.003 


0.019 


0.004 


0.0045 


0.006 


0.0525 



5 SUMMARY AND CONCLUSIONS 

In this paper we proposed a new method, which we called 
'the hierarchical algorithm' to detect and explore WL peak 
counts. While previous studies h ave examined the benefits 
of using filters of a certain shape (jllennawi fc Spergej l2005l : 
iMaturi et "al]|2005l : iGruen et ai.ll201ll '). here we have focused 
on the way the filtering should be performed to maximize the 
inferred cosmological constraints. To this goal, we have used 
a large set of WL maps produced by ray-tracing through 
Ai'-body simulations with varying cosmological models, as 
described in section ^ 

Our method was based on the idea of sequential smooth- 
ing of the maps with filters of different size, from the largest 
to the smallest. The chosen filter was an aperture-mass fil- 
ter, matching the NFW density profile of halos. Combining 
the information contained in the maps smoothed on differ- 



ent scales, we determined the largest filter size for which a 
peak would not only be a point of local maximum, with a 
S/Af larger than a certain threshold, but also a match to the 
NFW profile of the filter. For the latter we have assumed a 
fixed redshift equal to the optimal redshift for lensing given 
the mean redshift of the source distribution. Under this as- 
sumption, we assigned a unique value of mass and S/J\f to 
the detected peaks, as described in detail in section ^ Thus, 
the peak function arising from the hierarchical method does 
not depend on a particular filter size. 

We compared the hierarchical peak abundance to that 
obtained from applying a filter of fixed size; for the latter 
we used the same aperture filter and considered three sizes 
{13.2, 8.6, 4.8} arcmin. At the assumed redshift Zm = 0.3, 
these correspond to halos with a Sheth-Tormen mass of 
M = {2 X 10^^5 X 10", 10"} /i^^M©. To quantify the ef- 
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ficiency of the smoothing methods, we took the Fisher ma- 
trix approach: we compared the errors on the cosmological 
parameters derived from each method. The considered pa- 
rameters were: {us, fim, crs,w}. Our findings are as follows: 

(i) The marginalized Fisher matrix errors obtained from 
the hierarchical peak abundance combined with CMB infor- 
mation from Planck were better by a factor of 2 compared 
to the results of the single-sized filtering. This was true if 
we took into account low peaks with S/J\f — 3; if we allowed 
only peaks with iS/A/" > 6 then the hierarchical errors were 
only marginally better. 

(ii) The three filters of fixed size yield very similar results, 
the largest being slightly more efi'ective. 

(iii) We have provided a cosmology forecast for WL peak 
counts relevant to future surveys like Euclid and LSST. 
Combined with information from a CMB experiment such 
as Planck, the hierarchical marginalized errors for the con- 
sidered parameters were: Ana = 0.0039 (0.004); Af2m = 
0.002 (0.0045); Aag = 0.003(0.006); Aw = 0.019 (0.0525), 
where the values in the parenthesis corresponded to the re- 
sults of the largest, fixed-size filter. Note that we have as- 
sumed no knowledge of the redshifts of the peaks, and yet 
have obtained values in a reasonable accord with analytical 
forecasts in the literature. 

(iv) The high-S /J\f and high-mass ends of the hierarchi- 
cal peak function were reasonably described by the Poisson 
distribution, e.g. Figures lAll IA2I IA5I IA6I since the hier- 
archical filtering successfully assigned the largest mass and 
S/M to the largest and rarest peaks. 

(v) The results of the Fisher matrix analysis had a slight 
dependence on the number of S/M bins used: The most 
suitable number of bins for the hierarchical method was 20. 

We have checked that the hierarchical method yields similar 
constraints if one bins the peak information in mass and not 
S/M , which is a reassuring consistency check. 

There are certain improvements that one could bring to 
the hierarchical method. First, the choice of filter shape: in 
this study, we have resorted to a filter which is optimal if 
one assumes the shape noise of galaxies as the main source 
of noise for WL measurements. Though this filter is also ef- 
fective in reducing the impact of correlated line-of-sight pro- 
jections for the measured peaks (Marian et al. 2009, 20l3), 
one could use a m ore sophisticated shape, as discussed in 
iGruen et all (l201ll ). Second, one should test the benefits of 
having more redshift information on the source galaxies, i.e. 
use tomography to improve the cosmological constraints de- 
rived from the peak abundance. We defer these issues to a 
future study. 

The main message conveyed by our work is that, com- 
pared to the standard approach of single-sized smoothing 
usually discussed in the literature, the hierarchical method 
extracts significantly more of the cosmological information 
enclosed in WL peak counts. Therefore, it will be a very 
useful tool for surveys like Euclid and LSST which have the 
potential to detect many thousands of peaks. 
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Mass [h ^Mq] 



Figure Al. The correlation coefficient r^j for the hierarchical 
peak abundance, binned in mass. The measurements are an av- 
erage of 128 fields of the fiducial model. 
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Figure A2. Same as in the previous figure, only the binning is 
in S/N. 
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Figure A3. The correlation coefficient for the fiducial peak abun- 
dance measured with a filter of fixed size Af = 2 x W^^h^^ Mq. 
The binning is in S/J\f. 
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Figure A4. The cS/A^-mass relation for the hierarchical (red solid 
circles) and single-sized filtering methods (blue solid squares). For 
the latter, we adopted the size Af = 2 X IQlS H-'^Mq. Only peaks 
detected through both methods are being compared. For both 
methods, the mass is the hierarchical aperture mass of Eq. I I13I1 . 
For the hierarchical method the relationship between the assigned 
mass and S/M of the peaks is very tight, with little scatter. For 
the single-sized method there is significantly more scatter, partic- 
ularly for the lower S/M bins. For clarity, we have slightly offset 
the position of the points on the x-axis. 



APPENDIX A: THE CORRELATION 
COEFFICIENT OF THE PEAK COUNTS 

For a covariance matrix C, the correlation coefficient is de- 
fined by 

Cij 



(Al) 



We explore the correlation coefficient of the peak abun- 
dance based on measurements from 128 fields of the fiducial 
model. Figure [XT] shows r for the hierarchical method, us- 
ing aperture mass bins. The mass is assigned according to 
the description given in section ijS] The result is very sim- 
ilar to the correlation coefficient of the 3D halo abundance 
l|Smith fc Marianll201ll ): There are strong correlations at the 
low-mass end (M < 3 x 10 Mq) while the large-mass 
bins are mostly uncorrelated. In Figure E2l we show the hier- 
archical results for r based on the same measurements, but 
using bins in S/J^ instead of mass. The same pattern as in 
the previous figure is visible, with the lower bins S/M sS 6 
having a correlation coefficient of — 0.4. Finally, Figure [XS] 
depicts r measured from the same fiducial maps, using a 
single filter of size M = 2 x lO^^/i^^M©. The correlations 
seem weaker than for the hierarchical case, but they extend 
to higher S/M, the highest bin being however largely uncor- 
related. This is further explained in the next figure. 

Figure [X4l presents the iS/A/'-mass scatter of peaks de- 
tected through the two methods. For the fiducial-cosmology 
peak abundances, we compare the coordinates of the peaks, 
to select only those peaks found through both methods. 
For these peaks, we concentrate on the following 3 quanti- 
ties: the hierarchical S/N given by Eq. (|14p . the single-sized 



S/N from Eq. (O, and the hierarchical aperture mass from 
Eq. (|13p . We consider S/N bins. For each bin we compute: 
the mean S/N of the hierarchical and single-sized peaks in 
that respective bin, as well as the mean mass, and the er- 
ror on the mass. We emphasize that for both methods, by 
mass we mean the aperture mass assigned through the hier- 
archical algorithm, i.e. Eq. H13[). The figure shows that both 
methods yield on average a similar relation between mass 
and S/N ■ However, in the hierarchical case, this relation is 
very tight: large-/small-mass peaks have large/small S/N, 
hence the similarity between the correlation matrices in Fig- 
ures ED and m For the single-filter method this is not the 
case, as the size of the error bars suggests that peaks with 
quite varying mass are binned in the same S/N bin. This is 
consistent with the correlation matrix shown in Figure IXSl 

In Figure Ksl we compare the fractional cumulative er- 
rors obtained using the full covariance matrix Eq. H27p to 
the Poisson errors Eq. H28p . for the two methods considered. 
The Poisson and fuU-covariance errors converge at the high- 
S/N end, in accord with the fact that the most massive 
peaks are assigned the largest S/N in both the hierarchi- 
cal and single-filter methods. Finally, in Figure [X6l we show 
the fractional and cumulative Fisher errors for the hierarchi- 
cal method, considering binning in mass, and not S/N. As 
already suggested by Figure lAll the Poisson statistic cap- 
tures reasonably well the high-mass end of the distribution 
of hierarchical peaks, where the fuU-covariance and Poisson 
errors converge. 

To conclude, the Poisson distribution can be used to 
approximate the likelihood function of high-S/N and high- 
mass hierarchical peaks. 



APPENDIX B: THE NUMBER OF BINS 

We now address the issue of the number of bins used to 
estimate the Fisher matrix in Eq. (|27p . We expect that too 
coarse a binning will diminish the constraints, as the infor- 
mation in the maps would not be fully captured. We also ex- 
pect the constraints to saturate once a large enough number 
of bins is considered. However, for all filtering methods we 
noticed a continuous improvement in the constraints with 
the increasing number of bins. This is most likely due to 
noise in the covariance matrix measurement. 

As a remedy, we apply the correction discussed by 
iHartlap et al.l (|2007l ). Given a data set drawn from a multi- 
variate Gaussian distribution, and given the maximum- 
likelihood estimator for the covariance matrix C, i.e. 
Eq. (|30p . then an unbiased estimator for the inverse covari- 
ance is: 

N - iVbi„ - 2 , 



c-^ = 



iV- 



(Bl) 



where A'^bin is the number of bins, and is the number of 
realizations - in our case the number of fiducial fields. We 
use logarithmically-spaced S/N hins in the interval [2.8, 14]. 
Estimating our Fisher matrix with the above equation alle- 
viated significantly the dependence of the constraints on the 
number of bins used, as seen in Figure iBll The figure shows 
the dependence of the unmarginalized Fisher errors on the 
number of bins in which the S/N interval is divided. All 
errors show little evolution with the number of bins, and are 
relatively stable once A^bin ~ 20, except for the case of ris. 
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Figure A5. Comparison between Poisson and full-covariance fractional unmarginalized errors, i.e. Egs l|28|l and For clarity, we 

show only the results for a single-sized filter of Af = 2 x 10^® H^^Mq — blue squares - and the hierarchical method - red circles. The full 
errors are the same as in Figure [S] 
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Figure A6. Comparison between the full-covariance and Poisson cumulative unmarginalized errors, with the information binned in 
terms of aperture mass, i.e. Eq. I I13I I. We show only results for the hierarchical method. The two errors converge at the high-mass end, so 
the hierarchical algorithm successfully assigns the highest masses to the most massive peaks, well described by the Poisson distribution. 
These results are similar to those shown in Figure lASl 
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Figure Bl. Dependence of the fractional unmarginalizcd errors on the number of S/M bins. We used Eq. ||B1| | to estimate the Fisher 
matrix Eq. II27I I. The hierarchical method performs best for a choice of A^bin = 20, while the single-filtering results seem largely independent 
of the number of bins used. 

where there is slightly more evolution. The entire analysis 
presented in this work was carried out for A^bin = 20. 
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